Skip to main content

All Questions

1vote
1answer
79views

How to tune the classification threshold in a cost-sensitive manner?

I have trained a classifier outputting probabilities for each class. I want to tune the decision threshold in such a way that it accounts for different costs/gains assigned to false positives ($FP$), $...
MuhammedYunus's user avatar
2votes
4answers
147views

Predicting Disease Drugs

I have a dataset in the format: ...
Atom Store's user avatar
4votes
1answer
378views

Selecting a boundary on a binary classifier to optimal precision and recall

I have a logistic regression classifier that shows differing levels of performance for precision and recall at different probability boundaries as follows: The default threshold for the classifier to ...
Sandy Lee's user avatar
0votes
1answer
491views

Discrete values as target variable

I have discrete values in the target variable(Exactly 13 different values in total) . When I am giving that as input to Random forest Classifier ,it gives error that input as continuous. And if I give ...
Chinti's user avatar
2votes
3answers
1kviews

Why is my training accuracy decreasing higher degrees of polynomial features?

I am new to Machine Learning and started solving the Titanic Survivor problem on Kaggle. While solving the problem using Logistic Regression I used various models having polynomial features with ...
Apoorv Jain's user avatar
15votes
2answers
28kviews

How do I get the feature importace for a MLPClassifier?

I use the MLPClassifier from scikit learn. I have about 20 features. Is there a scikit method to get the feature importance? I found clf.feature_importances_ but it seems that it only exists for ...
jochen6677's user avatar
1vote
2answers
1kviews

ROC curve shows strange results for imbalanced dataset

I have a classifier with a heavily imbalanced dataset (1000 of each negative label for each positive.) I'm running a GradientBoostingClassifier with moderate success (AUC .75) but the curve has this ...
user3642173's user avatar

close